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SOME  LOCALLY  OPTIMAL  SUBSET  SELECTION  RULES 
FOR  COMPARISON  WITH  A  CONTROL* 
by 

Deng-Yuan  Huang,  National  Taiwan  Normal  University 
S.  Panchapakesan,  Southern  Illinois  University 

and 

Sheng-Tsaing  Tseng,  National  Taiwan  Institute  of  Technology 

1.  INTRODUCTION 

Let  ff0 ,1rl  *  *  *  ‘  ,irk  be  k  +  1  independent  populations  where  has  the 
associated  distribution  function  F(x,e^)  and  density  f(x,ei- )  with  the  un¬ 
known  parameter  belonging  to  an  Interval  ©  of  the  real  line  R.  We  are 
generally  interested  in  two  types  of  goals.  Goal  I  is  to  select  a  sub¬ 
set  (preferably  small  in  size)  of  the  k  populations  ir-j,...,^  that  will 
contain  the  best  (suitably  defined)  among  them  and  Goal  II  is  to  select 
from  ir^,...,irk  (k  experimental  treatments)  those  populations,  if  any, 
that  are  better  (to  be  defined)  than  7rQ  which  is  the  control  or  standard 
population. 

In  the  recent  years,  several  authors  have  investigated  construction 
of  optimal  subset  selection  rules  and  also  established  optimality  prop¬ 
erties  of  known  selection  rules  for  specific  cases.  Some  of  the  impor¬ 
tant  papers  In  these  directions  are  Berger  and  Gupta  [1],  Bickel  and 
Yahav  [2],  Bjdrnstad  [3],  Chemoff  and  Yahav  [4],  Goel  and  Rubin  [5], 
Gupta  and  Hsiao  [6],  Gupta  and  Hsu  [7],  Gupta  and  Huang  [8,9],  Gupta 
and  Kim  [12,13],  Gupta  and  Mlescke  [14],  and  Mlescke  [17].  These  In¬ 
vestigations  generally  deal  with  the  symmetric  case  of  equal  sample 

*The  research  of  the  first  two  authors  was  supported  by  the  Office  of 
Naval  Research  Contract  N00014-75-C-0455  at  Purdue  University.  Repro¬ 
duction  in  whole  or  In  part  1$  permitted  for  any  purpose  of  the  United 
States  Government. 


sizes.  There  have  been  some  investigations  in  the  unequal  sample  sizes 
case  but  these  $re  concerned  with  ad  hoc  and  heuristic  procedures  and 
are  not  generally  successful  in  establishing  the  least  favorable  configu- 
ation  (LFC)  for  the  probability  of  a  correct  decision.  For  many  classical 
procedures  in  the  literature  for  selecting  a  subset  containing  the  best, 
the  LFC  turns  out  to  be  =  ...  =  e^.  This  provides  the  motivation  for 
seeking  rules  which  are  optimal  in  some  suitable  sense  in  a  neighborhood 
of  every  equiparameter  point.  When  we  are  comparing  these  populations 
with  ttq,  the  local  optimality  is  related  to  the  ability  to  reject  the 
populations  that  are  inferior  to  itq  and  select  those  that  are  superior 
in  a  suitable  neighborhood  of  selected  parametric  configurations. 

Though  some  early  Investigation  of  locally  optimal  subset  selection 
rules  based  on  ranks  appeared  in  Nagel  [18],  such  rules  were  not  investi¬ 
gated  further  until  recently.  Some  locally  optimal  subset  selection 
rules  based  on  ranks  were  derived  by  Gupta,  Huang  and  Nagel  [11]  and 
Huang  and  Panchapakesan  [16].  The  first  of  these  papers  considered  Goal 
I  whereas  the  latter  considered  both  Goals  I  and  II.  Though  these  rules 
were  based  on  ranks  It  was  assumed  that  the  functional  form  of  the  den¬ 
sity  is  known;  the  justification  for  seeking  rules  based  on  ranks  comes 
from  the  usual  robustness  considerations  in  that  the  ranks  are  Insensi¬ 
tive  to  outliers  and  there  could  be  possible  deviations  from  the  model. 
Gupta,  Huang  and  Nagel  [11]  maximized  the  probability  of  a  correct  selec¬ 
tion  in  the  neighborhood  of  any  equiparameter  point  whereas  Huang  and 
Panchapakesan  [16]  used  for  Goal  I  the  criterion  of  strong  monotonicity 
In  the  same  type  of  neighborhood  and  for  Goal  II  a  local  optimality  con¬ 
dition  which  reflects  the  sensitivity  of  the  rule  when  all  but  one  popu¬ 
lation  are  not  distinctly  superior  and  the  remaining  one  Is  close  to 
others  but  distinctly  superior. 
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The  present  paper  derives  a  locally  optimal  subset  selection  rule 
for  Goal  II  based  on  parametric  Inference  type  statistics  with  no  assump¬ 
tion  of  equal  sample  sizes.  In  Section  2,  we  state  a  local  optimality 
condition  and  prove  the  main  theorem  giving  the  construction  of  a  locally 
optimal  rule.  Section  3  deals  with  the  applications  of  the  general  re¬ 
sult  to  the  following  special  cases:  (a)  normal  means  comparison  -  common 
known  variance,  (b)  normal  means  comparison  -  common  unknown  variance, 

(c)  gamma  scale  parameters  comparison  -  known  (unequal)  shape  parameters 
and  (d)  regression  slopes.  In  all  these  cases,  the  locally  optimal  rule 
Is  obtained  based  on  samples  of  unequal  sizes. 

Finally,  for  detailed  discussions  on  optimality  and  other  aspects  of 
multiple  decision  problems  and  general  decision  theoretic  approach,  the 
reader  is  referred  to  Gupta  and  Huang  [10]  and  Gupta  and  Panchapakesan 
[15]. 


2.  DERIVATION  OF  THE  RULE 

Let  j=l,...,n^,  denote  the  random  sample  fromi^.,  1=0,l,...,k. 

Our  goal  Is  to  construct  a  rule  to  select  all  populations  that  are  better 
than  the  control.  The  selection  rule  will  depend  upon  the  observations 
through  the  statistics  T^q,  1*1, ...,k,  where  T^j  is  suitably  defined  to 
Indicate  the  difference  between  ir^  and  ttj.  For  fixed  n0,nlf...,nk,  we 
assume  that  T41  has  a  density  function  g  (t< J  depending  on  the  param- 

eter  x^.  The  parameter  is  a  measure  of  the  'separation'  of  from 
ttj.  Also,  Is  the  same  for  all  1  and  this  common  value  Is  denoted  by 
t*.  For  example,  In  the  location  case,  we  can  take  “  flj  Saving 

x*  *  0.  On  the  other  hand.  If  e  Is  a  scale  parameter,  we  can  take 

T1j  *  9l/°j  so  T*  *  1* 


Dlst 


Availability  Codes 
(Avail  and/or 
Special 


ft 


Now,  we  define  population  to  be  superior  to  the  control  If 
t10  >  t*  and  Inferior  otherwise.  Let 

Sl  *  •  •  •  »T|jq) ,  R,  1-1,. ...k}  , 

fl0  ■  <ilTio“-*-"TkO"T*}  ’ 
s  {ilTio>  T**TjO’  ' 

Define  6  =  Ur...,«k),  where  «1(l<i<fc)  Is  a  test  function  for  H0:t€&0 
vs.  H.:  t€£J^  . 


2.1.  Optimality  Requirement.  Let  0  <  y.  <1  (l<i<k)  be  specified. 


Def i ne 

(2.1) 


S  *  (£JET*(fit)“Yf .  1  <.1  <.k>  . 


We  wish  to  derive  a  rule  6€  S  which 


(2.2) 


maximizes  l  -r—~  E  (6.) 

1-1  3T10  i  1  T=T* 


among  all  the  rules  In  S.  We  note  that  «_€  S  means  that  the  error  prob¬ 
abilities  are  controlled  and  that  the  condition  (2.2)  amounts  to  maximiz¬ 
ing  the  efficiency  In  a  certain  sense  of  the  rule  In  picking  out  the 
superior  population  In  the  direction  of  each  component  at  t*  -  (t*,...,t* 
Let  hT(t)  denote  the  joint  density  of  T  *  (Tio* *  *  *  *TkO^  w1th  resPect 
to  a  o-flnlte  measure  u.  Let  hT*(t)  denote  the  density  hT(t)  when  t.  -  t* 
and  h^(t)  denote  the  partial  derivative  t-2—  h  (t)  evaluated  at  t  -  t* 

t  -  3Ti0  T  -  “  - 

Finally,  we  need  to  assume  certain  regularity  conditions,  namely,  that 
hT(t)  Is  continuously  differentiable  with  respect  to  each  component  of 
t  and  t*x(jt)  is  Integrable.  Under  these  regularity  conditions,  It 

Is  easy  to  see  that 


'c  In  t  1 
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where  j  Is  the  sample  space  of  T  =  •V* 

Summarizing  the  above  discussion,  we  are  seeking  a  rule  £  (t) 
such  that 

(2.3)  /  <t(t)hT«(t)dp(t)  *  y< »  1-1..... k 

I  - 

and  It  maximizes,  among  all  rules  satisfying  (2.3),  the  expression 

(2.4)  l  j  «<(t)hjl,(t)di.(t)  . 

i=l  j  - 

2.2.  A  Locally  Optimal  Rule.  We  now  state  and  prove  the  main 
theorem  of  this  section. 


Theorem  2.1.  Under  all  the  assumptions  stated  previously,  a  rule 
6°(t)  which  satisfies  (2.3)  and  maximizes  (2.4)  among  all  rules  satisfy¬ 
ing  (2.3)  Is  given  by 


(2.5) 


6°(t)  • 


1 

X,  If  ^(t) 


cihT*(t) 


where  and  c^  such  that 

(2.6)  /  «?(t)hT*(t)dw(t)  -  Yi,  1-1 . k  . 

j  - 


Proof.  The  proof  Is  straightforward  by  noting  that  for  any  £ 
satisfying  (2.3)  we  have 


l  /  (d^D-d^^Jthjycy-c^^DJdvd)  >  0. 

T  —  — 


(2.7) 


3.  SPECIAL  CASES 


In  this  section,  we  apply  the  result  of  Section  2  to  several  special 
cases. 


Case  A:  Normal  Means  Comparison;  Common  Known  Variance.  Here  ir. 

2  2 

is  N^.oq),  where  og  is  known.  Let  be  the  sample  mean  based  on  n^ 
independent  observations  from  ir^,  i=0,l,...,k.  Take  T^g  =  (^-Xg)/og 
and  t^g  =  (e^-9g)/og.  Of  course,  t*  =  0.  Then  the  joint  density 
hT(t)  of  T  =  (T^g,... ,Tkg)  is  a  multivariate  normal  distribution  with 
mean  vector  t_  =  (^g....."^)  and  covariance  matrix  A  *  U^j)  given  by 


(3.1) 


_1_ 

n0 


for  i  *  j 


for  i  f  j 


where  a1  =  n^ng,  1*1,..., k.  Thus 

(3,2)  hL(t)  *  (2ir)-k/2|Er1/2exp[-i(t-l),A‘1(t-t)]. 

It  is  easy  to  verify  that  h|^(t)/hT*(t) 

■  (A'1t)i 


where  (A_1t)j  denotes  the  1th  coordinate  of  the  vector  (A-1t)  and  in  sum¬ 
mations  Involving  the  a.  such  as  7  a.,  the  subscript  ranges  from  1  to 

0 

k  subject  to  any  exceptions  stated.  The  locally  optimal  rule  fr  Is  now 
given  by 
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«®<t) 


1  if  *i  “  TT 


(3.3) 

l 

0  otherwise 
where  cf  Is  determined  by 

(3.4) 


& 


l  aj‘j  i  c? 


v<T,'i7ir57  ^ajTjic?,'Ti  ■ 

Now,  using  the  fact  that,  when  t  =  ^*(=0),  (A-1T).j  is  normally  distribut¬ 
ed  with  mean  zero  and  variance  nna4(l+  T 

0  1  ih 


a£)/(l+zaA),  it  is  easy  to 


see  that 
(3.5) 


c*  =  *"1(l-yi)/(l+zai)/n0ai(l+  ~J  a^) 


=  *'10-Y1)/N7niTN-nV) 


where  N  =  nQ  +  n1  +  ...  +  nk  and  *  denotes  the  standard  normal  distribu¬ 
tion  function. 


Remark  3.1.  The  individual  selection  probability  «9 ( t_)  in  (3.3) 
can  also  be  expressed  as 


if 


-N-n,  }l}  "j‘j  i  C1 
J/1 


\  0  otherwise 

In  this  form,  it  can  be  recognized  as  a  weighted  average  type  rule. 

When  the  sample  sizes  are  equal,  it  Is  the  usual  average  type  rule  and 

selects  if  and  only  if  t^  -  ^  7  tj  >  cf. 

j.i 


Case  B:  Normal  Means  Comparison;  Conwon  Unknown  Variance.  Unless 

? 

stated  otherwise,  the  notations  of  Case  A  will  apply  here.  Let  denote 
the  sample  variance  (divisor  n^  -  1)  based  on  the  sample  from  Then 
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2  k  a  ^ 

Sp  I  (nj-l)Sj/  l  (riQ-1 )  Is  the  usual  pooled  estimator  of  the  common 

unknown  variance  a2  on  v  *  N  -  k  -  1  degrees  of  freedom.  Define 
Y..  =  (XrXQ)  and  T^q  *  Y^/Sp»  1Bl«...*k.  Then  ® 

multivariate  normal  t  distribution  and  Its  density  hT(t)  can  be  written 
In  an  integral  form  as  follows: 

(3.6)  hT(t)  =  Aq  /  e  /  v  /v  2  w  2  dw  , 

where  t_  =  (-r10,...,Tk0),  T10  *  (0i"0O^0’  AQ  is  the  appropri¬ 

ate  constant,  and  a  =  (x^)  Is  the  same  as  in  Case  A  given  by  (3.1). 


Now,  letting  Cv  k  =  fl.  r (v*i^-)/r(^-)  ,  it  is  easy  to  verify 
^i}(t)/hT*(t)  =  Cv>k(A"1t)1// v+t'A^t 
)tk(A'V)i//vSp+y.'A4)r 


that 


*  C 


v,k 


yr  s=ht  X  Yd 
jl} 


vSp+£'A"  Z 


nj(N-nj) 


where  y  =  (ylf..'.,yk). 

Thus,  the  locally  optimal  rule  is  given  by 


Cv,k  N  '  ,*'^*sp)»  sa*» 


(3.7) 


S?(*.Sp) 


1  if  +(Z»sp)  >  c* 
0  otherwise 


where  c^  Is  determined  by 
(3.8) 


P^WLS^iCf)  •  Yi 


Since  Yi  -  jpp  1  n^Yj  has  a  normal  distribution  with  mean  zero  and 
1  jf  1 


9 


variance  o^N/n^N-n^)  and  (vSp+Y'A _1Y.)/o^  has  a  chi-square  distribution 
with  N  -  1  degrees  of  freedom,  it  follows  that  <HY.»Sp)/ y^n.TN-n~7  ^as 

a  t-distribution  with  N  -  1  degrees  of  freedom.  So  we  see  from  (3.8) 
that 

(3‘9)  Ci  =  Sj.N-l  /n.(N-ni) 

where  t^  N1  denotes  the  upper  IOOy..  percent  point  of  the  t-distribution 
with  N  -  1  degrees  of  freedom. 


Case  C:  Gamma  Scale  Parameters  Comparison:  Unequal  (Known)  Shape 
Parameters.  Let  (i=0,...,k)  be  a  gamma  population  with  density 

V1 

(3.10)  f(x;ej,V|)  = — - — —  exp{-x/e.},  x>_0,  ei  >0, 

where  the  shape  parameters  v..  are  known.  We  take  t.q  *  e^/Og  so  that 
t*  —  1 .  Let  X^,  be  independent  observations  from  ^  and 

define  T^0  =  i=l,...,k.  The  joint  density  of  T^0,...,Tk0  is 

easily  derived  to  be 


(3.11) 


hT(t)  =  r(M) 


1 _ 

r(nivi) 


k 

where  M  =  j»  . 

i*0  1  1 


X 


"iV1 

li 

n^  I  H 
n0Ti0. 


* 


From  this  we  get 
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(3.12) 


fy^l r 


Mn^t^ 


V  jl,  Vj 


-  Vi 


Mn^x^ 

~R - 

l  n-x. 
j=0  J  3 


n1vi 


Thus  the  locally  optimal  rule  6_  is  given  by 


(3.13)  6V(x0,...,*k)  = 


1  if  n.x,/  l  n.x. 

11  j=0  3  3 


0  otherwise 
where  the  constant  c*  is  determined  by 


c 


* 

i 


(3.14) 


l 

j=0  3  3 


When  t.  =  t*,  n^Xj  has  a  gamma  distribution  with  parameters  e  and  n.v^. 

However,  the  probability  in  (3.14)  is  independent  of  e.  It  is  known 
k 

that  n.X./  T  n.X.  has  a  beta  distribution  with  parameters  n.v.  and 
*  '  j=0  ^  J  •  * 

M  -  n^..,  denoted  by  B^v..  ,M-n.jv^ ).  Thus  c^  is  the  upper  lOOy  percent¬ 
age  point  of  Bfn^v^  .M-n^.)  and  can  be  obtained  from  tables  of  incomplete 
beta  function. 


Remark  3.2.  It  should  be  first  pointed  out  that  the  above  prob¬ 
lem  includes  as  a  special  case  the  problem  of  comapring  normal  vari¬ 
ances  based  on  samples  of  unequal  sizes.  It  also  includes  the  problem 
of  comparing  scale  parameters  of  Weibull  populations  which  has  a  com¬ 
mon  known  shape  parameter  g.  If  X^j,  j=l,...,n^,  are  the  sample  obser¬ 
vations  from  then  we  can  transform  these  by  =  X^j  into  sample 


* 


* 


i 
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observations  from  an  exponential  distribution  with  mean  =  e!?.  Thus 
it  is  a  special  case  of  the  gamma  problem. 

Case  D:  Comparison  of  Regression  Slopes.  Let  denote  a  simple 
linear  regression  model 


(3.15) 


Y  =  a..  +  $,.x  +  e 


2  2 

where  e  ~  N(0,o  )  and  o  is  unknown.  Let  (x..,Y..},  j=l,...,n.,  denote 

*  J  *  J  » 

the  sample  data  from  (3.15).  Define  t<0  =  (s .-&Q)/a,  i=l,...,k.  The 
least  squares  estiamtors  of  a.  and  6..  are  given  by  a..  =  -  §,.x.  and 

5i  ■  VSN  '  Uhere  *»  '  “u'V  ?i  =  "VV  S«i  '  E(X«';i)2,  a"d 

$x  y  =  ^(x.j-x.) (Y.j-7.).  All  the  summations  are  over  j  going  from  1  to  n^ 

2  2 

The  usual  pooled  unbiased  estimator  of  a  is  S  = 
k  ni  k 

I  I  J2/  l  (n.,-2).  It  is  well-known  that  Q  =  vS?/a2 

i=0  j=l  1J  1  1  1J  iSQ  1  P 

has  a  chi-square  distribution  with  v  *  T  (n.-2)  degrees  of  freedom. 

i=0  1 

Define  Z,j  =  b..  -  gg  and  T,.q  =  Z,./Sp,  i=l,...,k.  Then  the  joint  density 
h^(t)  of  T  =  (T]o”’*’TkO^  1S  the  same  as  the  exPressi°n  in  (3.6)  but 
with  xi0  =  (e^BgJ/o  and  A  =  (a^.)  given  by 


(3.16) 


Xij  = 


-4-  (1+  t~)  »  i  =  j 

Sx  1 

x0 


i  t  j 


where 


,2  /c2 


i,  *  S‘  /St  ,  i»l,...,k.  It  is  now  easy  to  see  that  the  locally 
1  xi  xo 


optimal  rule  6°  is  given  by 
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(3.17) 


(1  If  *(2iS* . )  >  c* 

«?(*)  «  *0  k 


0  otherwise 


where  z  =  (Z|,...,zk) 


(3.18) 


.....S*  )  = 
x0  xk 


i/i  xj  J 


S  +  7  S4 
x0  .iri  > 

'vSp+z'A-1? 


The  constant  is  determined  by 


(3.19) 


PT*(+(?iS^ . S*  )  >C*)  *  y. 

i  x0  xk  1  1 


and  by  appealing  to  the  result  of  Case  B,  we  get 


(3.20) 


ci  =  tyi  ,N-k-2  T~ 


r+  V  s2 

X0  xj 


—  *■ 
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